# Structured information extraction
PP DocLayout L
Apache-2.0
PP-DocLayout-L is a high-precision document layout region localization model based on the RT-DETR-L architecture, supporting the detection of 23 common document layout categories.
Text Recognition Supports Multiple Languages
P
PaddlePaddle
285
0
PP DocLayout Plus L
Apache-2.0
PP-DocLayout_plus-L is a high-precision document layout area positioning model, trained based on the RT-DETR-L architecture, and supports the detection of 20 common document elements.
Text Recognition Supports Multiple Languages
P
PaddlePaddle
1,308
0
Nuextract 2.0 4B
MIT
NuExtract 2.0 is a series of multimodal models specifically trained for structured information extraction tasks. It supports text and image inputs and has multilingual processing capabilities.
Image-to-Text
Transformers

N
numind
272
3
Phi 3 Mini 4k Instruct Graph
Phi-3-mini-4k-instruct-graph is a fine-tuned version of Microsoft's Phi-3-mini-4k-instruct, specifically designed for entity relationship extraction from general text data, aiming to achieve comparable quality and accuracy to GPT-4 in generating entity relationship graphs.
Knowledge Graph
Transformers English

P
EmergentMethods
524
44
Teenytinyllama 160m Text Simplification Ptbr
Apache-2.0
A compact language model specialized in Portuguese text compression and structured processing, trained on 330,000 Portuguese texts.
Text Generation
Transformers Other

T
cnmoro
50
2
Donut Base Medical Handwritten Prescriptions Information Extraction
MIT
A model based on the Donut architecture for extracting structured information from medical handwritten prescriptions
Text Recognition
Transformers

D
Humayoun
18
1
Featured Recommended AI Models